Every document has a geographical scope
نویسندگان
چکیده
It is a useful premise to assume that every document in a collection and every query issued to an information retrieval (IR) system is geography-dependent. If one can determine what area an article is about (i.e., its’ geographical scope), this information can be used to improve the accuracy with which people, places and organizations named in the article can be located. More importantly, geographical scopes of documents may be exploited to improve the performance of IR systems against geography-dependent user queries by tuning relevance ranking and query expansion strategies with scope metadata. We want to answer the following pertinent questions to ascertain the usefulness of geographical information in improving retrieval accuracy: (1) how far can geographical information in queries and documents improve retrieval accuracy of IR systems when answering geography-dependent queries; and, (2) how effectively can geographical information in queries and documents be utilized to improve the quality of relevance ranking in geographical IR domain. This paper outlines strategies to determine the geographical scope of documents, and describes methods to utilize scope information to improve the performance of toponym resolution, relevance ranking and query expansion.
منابع مشابه
Assigning Geographical Scopes To Web Pages
Finding automatic ways of attaching geographical scopes to on-line resources, also called “geo-referencing” documents, is a challenging problem, getting increasing attention [1, 5, 3]. Here we present a system architecture and a process for identifying the geographical scope of Web pages, defining a scope as the region where more people than average would find that page relevant. We rely on typ...
متن کاملخوشهبندی اسناد مبتنی بر آنتولوژی و رویکرد فازی
Data mining, also known as knowledge discovery in database, is the process to discover unknown knowledge from a large amount of data. Text mining is to apply data mining techniques to extract knowledge from unstructured text. Text clustering is one of important techniques of text mining, which is the unsupervised classification of similar documents into different groups. The most important step...
متن کاملExperiments Adapting An Open-Domain Question Answering System Tothe Geographical Domain Using Scope-Based Resources
This paper describes an approach to adapt an existing multilingual Open-Domain Question Answering (ODQA) system for factoid questions to a Restricted Domain, the Geographical Domain. The adaptation of this ODQA system involved the modification of some components of our system such as: Question Processing, Passage Retrieval and Answer Extraction. The new system uses external resources like GNS G...
متن کاملجستاری در شناخت نظری مفهوم جغرافیای فرهنگی در چارچوب مکتب سازهانگاری
In the human science, one concept may be having been some definition or even may be this narration was contradiction with each other in different philosophical schools. Therefore, explanation of one concept in different cognition schools has very great importance. Geographical Space consists of bilateral relation between human and environment. In other word, this dimension and its ideological s...
متن کاملجستاری در شناخت بازتاب فضایی عملکرد بازیگران سیاسی در چارچوب مکتب پدیدارشناسی هرمنوتیک
Extended abstract Introduction In the human science, one concept may be having some definition or even may be this narration was contradiction with each other in different philosophical schools. Therefore, explanation of one concept or relationship in different cognition schools has very great importance. From philosophical aspects in human science, theoretical structure has very fundamen...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Data Knowl. Eng.
دوره 81-82 شماره
صفحات -
تاریخ انتشار 2012